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Abstract 

Given a hierarchical plan (or schedule) with uncertain task times, we may 
need to determine the probability that a given plan will satisfy a given deadline. 
This problem is shown to be NP-hard for series-parallel hierarchies. We pro¬ 
vide a polynomial-time approximation algorithm for it. Computing the expected 
makespan of an hierarchical plan is also shown to be NP-hard. We examine the 
approximation bounds empirically and demonstrate where our scheme is superior 
to sampling and to exact computation. 


1 Introduction 

Numerous planning tools produce plans that call for executing tasks non-linearly. Usu¬ 
ally, such plans are represented as a tree, where the leaves indicate primitive tasks, and 
other nodes represent compound tasks consisting of executing their sub-tasks either in 
parallel (also called “concurrent” tasks mi) or in sequence. (9]|22l H] El- 

Given such a hierarchical plan representation, it is frequently of interest to evalu¬ 
ate its desirability in terms of resource consumption, such as fuel, cost, or time. The 
answer to such questions can be used to decide which of a set of plans, all valid as 
far as achieving the goal(s) are concerned, is better given a user-specified utility func¬ 
tion. Another reason to compute these distributions is to support runtime monitoring 
of resources, generating alerts to the execution software or human operator if resource 
consumption in practice has a high probability of surpassing a given threshold. 

While most tools aim at good average performance of the plan, in which case one 
may ignore the full distribution and consider only the expected resource consump¬ 
tion a , our paper focuses on providing guarantees for the probability of meeting dead¬ 
lines. This type of analysis is needed, e.g., in Service-Level-Agreements (SLA) where 
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guarantees of the form: “response time less than lmSec in at least 95% of the cases” 
are common a. 

We assume that an hierarchical plan is given in the form of a tree, with uncertain 
resource consumption of the primitive actions in the network, provided as a probability 
distribution. The problem is to compute a property of interest of the distribution for the 
entire task network. In this paper, we focus mainly on the issue of computing the prob¬ 
ability P(t < T ) of satisfying a deadline T (i.e. that the makespan t of the plan is less 
than a given value). Since in the above-mentioned applications for these computations, 
one needs results in real-time (for monitoring) or multiple such computations (in com¬ 
paring candidate plans), efficient computation here is crucial, and is more important 
than in, e.g., off-line planning. 

We show that computing P(t < T) is NP-hard even for the simple sum of inde¬ 
pendent random variables (r.v.s), the first contribution of this paper. A deterministic 
polynomial-time approximation scheme for this problem is proposed, the second con¬ 
tribution of this paper. Error bounds are analyzed and are shown to be tight. For 
discrete r.v.s with finite support, finding the distribution of the maximum can be done 
in low-order polynomial time. However, when compounded with errors generated due 
to approximation in subtrees, handling this case requires careful analysis of the re¬ 
sulting error. The approximations developed for both sequence and parallel nodes are 
combined into an overall algorithm for task trees, with an analysis of the resulting 
error bounds, yielding a polynomial-time (additive error) approximation scheme for 
computing the probability of satisfying a deadline for the complete network, another 
contribution of this paper. 

We also briefly consider computing expected makespan. Since for discrete r.v.s, in 
parallel nodes one can compute an exact distribution efficiently, it is easy to compute 
an expected makespan in this case as well as for sequence nodes. Despite that, we show 
that for trees with both parallel and sequence nodes, computing the expected makespan 
is NP-hard. 

Experiments are provided in order to examine the quality of approximation in prac¬ 
tice when compared to the theoretical error bounds. A simple sampling scheme is also 
provided as a yardstick, even though the sampling does not come with error guarantees, 
but only bounds in probability. Finally, we examine our results in light of related work 
in the fields of planning and scheduling, as well as probabilistic reasoning. 

2 Problem statement 

We are given a hierarchical plan represented as a task tree consisting of three types 
of nodes: primitive actions as leaves, sequence nodes, and parallel nodes. Primitive 
action nodes contain distributions over their resource consumption. Although any other 
resource can be represented, we will assume henceforth, in order to be more concrete, 
that the only resource of interest is time. A sequence node v s denotes a task that 
has been decomposed into subtasks, represented by the children of v s , which must be 
executed in sequence in order to execute v s . We assume that a subtask of v s begins 
as soon as its predecessor in the sequence terminates. Task v s terminates when its last 
subtask terminates. A parallel node v p also denotes a decomposed task, but subtasks 
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begin execution in parallel immediately when task v p begins execution; v p terminates 
as soon as all of the children of v p terminate. 

Resource consumption is uncertain, and described as probability distributions in 
the leaf nodes. We assume that the distributions are independent (but not necessarily 
identically distributed). We also assume initially that the r.v.s are discrete and have 
finite support (i.e. number of values for which the probability is non-zero). As the 
resource of interest is assumed to be completion time, let each leaf node v have a 
completion-time distribution P v , in some cases represented as a cumulative distribution 
function form (CDF) F v . 

The main computational problem tackled in this paper is: Given a task tree r and 
a deadline T, compute the probability that r satisfies the deadline T (i.e. terminate 
in time t < T). We show that this problem is NP-hard and provide an approximation 
algorithm. The above deadline problem reflects a step utility function: a constant pos¬ 
itive utility U for all t less than or equal to a deadline time T, and 0 for all t > T. We 
also briefly consider a linear utility function, requiring computation of the expected 
completion time of r and show that this expectation problem is also NP-hard. 

3 Sequence nodes 

Since the makespan of a sequence node is the sum of durations of its components, 
the deadline problem on sequence nodes entails computation of (part of) the CDF of 
a sum of r.v.s which is an NP-hard problem (shown in Section [6J. Thus, there is a 
need for an approximation algorithm for sequence nodes. The main ingredient in our 
approximation scheme is the Trim operator specified as follows: 

Definition 1 (The Trim operator). For a discrete r.v. X and a parameter e > 0, 
consider the sequence of elements in the support of X defined recursively by: x\ = 
min support (X) and, if the set {x > Xj: Pr{xi < X < x) > e} is not empty, 
Xi+i = min (a; > Xi : Pr{xi < X < x) > e}. Let l be the length of this sequence, 
i.e., let l be the first index for which the set above is empty. For notational convenience, 
define xi+\ = oo. Now, define X' = Trim(X, e) to be the random variable specified 
by: 



For example, if X is a r.v. such that Pr(X=l)=0.1, Pr(X= 2)=0.1 and Pr(X= 4)=0.8, 
the r.v. X’= Trim(X, 0.5) is given by Pr(X'= 1)=0.2 and P7'(X'=4)=0.8. In words, 
the Trim operator removes consecutive domain values whose accumulated probability 
is less than e and adds their probability mass to the element in the support that precedes 
them. 

Using the Trim operator, we are now ready to introduce the main operator of this 
section: 

Definition 2. Let Sequence(Xi,... ,X n ,e) be Trim(Sequence(Xi,..., X„_i, e)+X ni e) 
and Sequence(Xi, e)= Trim(Xi, e). 
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Sequence takes a set of r.v.s and computes a r.v. that represents an approximation 
of their sum by applying the Trim operator after adding each of the variables. The 
parameter e, (see below) specifies the accuracy of approximation. 

The Sequence operator can be implemented by the procedure outlined in Algo¬ 
rithm Q] The algorithm computes the distribution Y^i =o using convolution (the 
ConvolveQ operator in line 3) in a straightforward manner. Computing the convolution 
is itself straightforward and not further discussed here. However, since the support of 
the resulting distribution may be exponential in the number of convolution operations, 
the algorithm must trim the distribution representation to avoid this exponential blow¬ 
up. This decreases the support size, while introducing error. The trick is to keep the 
support size under control, while making sure that the error does not increase beyond 
a desired tolerance. Note that the size of the support can also be decreased by simple 
“binning” schemes, but these do not provide the desired guarantees. In the algorithm, 
the PDF of a r.v. Xi is represented by the list D Xi , which consists of ( x,p') pairs, 
where x £ support(Xi) and p' is the probability Pr{Xi=x), the latter denoted by 
D Xi [a;]. We assume that 1)x, is kept sorted in increasing order of x. 


Algorithm 1: Sequence (Xi ,..., X n , e) 


1 D = ((0,1)) // Dummy random var.: 0 with prob. 1 

2 for i — 1 to n do 

3 D=Convolve(D, D Xi ) 

4 D=Trim (£>, £) 


s return D 

6 Procedure Trim (D,e) 
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D' = Q,p = 0 

do = dprev = min support (D) 

foreach d £ support(D) \ {do} in ascending order do 
if p + D[d\ < e then 
| p = p + D[d] 

else 

Append (d pre „, D{d pre v\ + p) to D' 

dprev = d, p = 0 


Append ( d pre v 

return D' 


, D[d p 


- p) to D' 


We proceed to show that Algorithm Q] indeed approximates the sum of the r.v.s, 
and analyze its accuracy/efficiency trade-off. A notion of approximation relevant to 
deadlines is: 

Definition 3. For r.v.s X', X, and e £ [0,1] we write X' X if 0 < Pr{X' < 
T) - Pr(X < T) < efor all T > 0. 

Note that this definition is asymmetric, because, as shown below, our algorithm 
never underestimates the exact value. For the proof of the Lemma[I|below, we establish 
the following technical claim (can be proven by induction on n): 
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Claim 1. Let (ai)" =1 and (6j)™ =1 be sequences of real numbers such that at > 0 
for all 1 < k < n and b\ > b 2 > • • ■ > b n > 0 , then af>i > 0 

We now bound the approximation error of sums of r.v.s: 

Lemma 1. For discrete r.v.s Xi, Xf X 2 , Xf, and £\,£ 2 £ [0,1], « £l Xi and 

X! 2 « £2 X 2 , then X[ + X’ 2 « £l+£2 + X 2 . 

Proof. Pr(X[ + A'j < T) - Pr(XT + X 2 <T) 

T T 

= Y.P^ X i=j) Pr(X' 2 <T-j) -]T Pr(Xi=j)Pr(X 2 <T—j) 

- 7-1 <Pr(X 2 <T-j)+E 2 J = 1 

T T 

<^(Pr(X^i)-Pr(X 1 =i)) Pr(.Y 2 <T-j) + £ 2 ^Pr(X[=j) 

J=1 e [0,1] 3=1 

T T 

<J2( p r( x 'i=j) ~ Pr(Xi=j)) +£2 J2 Pr(X[=j) < £1 + £ 2 . 

3 =1 i=i 

<ei e[o,i] 

Finally, we show that the difference is also nonnegative: 

Pr(X[ + X 2 < T) - Pr(Xi + X 2 < T) 

T T 

= ^>r(X[=j)Pr(X^<P—j)— Pr{X\ =j)Pr(X 2 <T—j) 

3 = 1 3=1 

T 

= YfPr{X[=j) - Pr(Xi=j))Pr(X 2 <T—j) 

3=1 T 

+ J2 Pr ( X 'i=0)(Pr( X 2<T-j) - Pr(X 2 <T-j)) 

3 = 1 

The first term here is non-negative by ClaimQ] the second is nonnegative because it is 
a sum of nonnegative numbers. □ 

We now show that Trim(X, s) is an e-approximation of X: 

Lemma 2. Trim(X, e) ~ £ X 

Proof. Let X’ = Trim(X, e). Let x\< • • • <x m be the support of X' and let l = 
max{i: Xi <T}. We have, 

Pr(X' = = Pr(xi < X < x i+ f) (1) 

because, after Trim, the probabilities of elements that were removed from the support 
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are assigned to the element that precedes them. From Equation <(T]) we get: 

Pr(X' <T)~ Pr{X < T) 
i -1 

= ^ ~ 2 (Pr(X'=Xi ) - Pr(xi < X < x i+ i)) 

+ (Pr(X'=xi)-Pr(xi<X <T)) 
=Pr{X'=xi)-{Pr{xi<X<xi + 1 )-Pr{T<X<xi +1 )) 

=Pr(T < X < xi+ 1) G ( 0 ,e] 

The inequality Pr(T < X < xi- |_i) < e follows from the observation that, for all i, 
Pr{xi < X < Xi- |_i) < e, because p is never greater than e in AlgorithmQ] □ 

To bound the amount of memory needed for our approximation algorithm, the next 
lemma bounds the size of the support of the trimmed r.v.: 

Lemma 3. | support(Trim(X, e))| < 1/e 

Proof. Let X' = Trim(X, e) and let x\ < • • • < x m be the support of X'. And, 
for notational convenience, let x m +i = oo. Let pi = ^2 X . <X<X+1 Px(X=x). Then, 

1 = E£Li Pr{X'=Xi ) = Pr(X=x 1 ) +Ya=i (Pi + Pr(X=x i+1 )) +p m . According 
to algorithmlH lines 11-12, Pi < e and pi + Pr(X=Xi+ 1 ) > e for all 0 < * < to. 
Therefore, 1 = YLIlLi Pr{X'=Xi) > Pr(X'=x\) + (to — 1) • e + p m - Using the fact 
Pr{X’=x i) > 0, we get: (to — 1) • e < 1 => m < 1/e + 1 =» to < 1/e. □ 

These lemmas highlight the main idea behind our approximation algorithm: the 
Trim operator trades off approximation error for a reduced size of the support. The fact 
that this trade-off is linear allows us to get a linear approximation error in polynomial 
time, as shown below: 

Theorem 1. IfX ' Xifor all i G {1,..., n} and X = Sequence(X[ 1 ..., X ' n , e) 
then X « e EILi E. where e = E"=i £ i + n£ - 

Proof, (outline) For n iterations, from LemmaQ] we get an accumulated error of e\ + 
From Lemma[2j we get an additional error of at most ne due to trimming. □ 

Theorem 2. Assuming that m < 1/e, the procedure Sequence(X [,... ,X' n ,e) can 
be computed in time 0((nm/e) log (m/e)) using 0(m/e) memory, where m is the size 
of the largest support of any of the X's. 

Proof. From Lemma[3j the size of list D in AlgorithmQ]is at most m/e just after the 
convolution, after which it is trimmed, so the space complexity is 0{m/e). Convolve 
thus takes time 0({m/e) log(?n/£)), where the logarithmic factor is required internally 
for sorting. Since the runtime of the Trim operator is linear, and the outer loop iterates 
n times, the overall run-time of the algorithm is 0{{nm/e) log {m/e)). □ 
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Example 1. The error bound provided in Theorem\I]is tight, i.e. Sequence(Xi,... ,X n ,e/n ) 
may results in error e: Let 0<£<1 and nCN such that \—e>e/n. Consider, for very 
small 6 > 0, the r.v. X\ defined by: 


Pr(X i=x)= 


£ / (n( 1 — S) x ) 

X ° Z^x= 1 n(l-5)* 


x = 0, 

x S {1, • • •, n}, 
x = n + 1, 
otherwise 


and, for i = 2,..., n, let the r.v.s Xi be such that Pr(Xi=0)=l—S, Pr(Xi=n 2 )=S, 
and zero otherwise. 


4 Parallel nodes 


Unlike sequence composition, the deadline problem for parallel composition is easy 
to compute, since the execution time of a parallel composition is the maximum of the 
durations: 

n n 

Pr( max Xi < T)=Pr( Ax,< T)= TT Pr(Xi < T) (2) 

*6[l:n] ' \ x , 

2=1 z= 1 


where the last equality follows from independence of the r.v.s. We denote the construc¬ 
tion of the CDF using Equation (0 by Parallel(Xi,... ,X n ). If the r.v.s are all 
discrete with finite support, Parallel(Xi,..., X n ) incurs linear space, and compu¬ 
tation time 0(nmlog(n)). 

If the task tree consists only of parallel nodes, one can compute the exact CDF, 
with the same overall runtime. However, when the task tree contain both sequence and 
parallel nodes we may get only approximate CDFs as input, and now the above straight¬ 
forward computation can compound the errors. When the input CDFs are themselves 
approximations, we bound the resulting error: 

Lemma 4. For discrete r.v.s X[,... X' n , Xi,..., X n , if for all i = 1,.... n, X[ 

Xi and 0 < £j < f or some K > 0, then, for any e > £j, we have: 

max, 6 [ 1:n ] X[ « e max, 6 [ 1:n ] where e = Yn=i £ i + e / K - 

Proof. Pr{ max i€[1:n ] X[ < T)-Pr{ max lG[1:n ] X z < T ) 


n n 

<ri( p r(Xi <T)+ Ei)-Pr{Xi < T ) 

i= 1 i =1 


< ]^[(l + Ei) 
i= 1 


< 1 + ^ 



- 1 


<J2 ei + Y, nk£k ^J2 £i + Tz 


1 — ns 


sum of a geo. series 


< ^3 £i + £ /k 

i=l 


Since Pr{X[ < T) > Pr(Xi < T) for each i, this expression is nonnegative. □ 
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5 Task trees: mixed sequence/parallel 


Given a task tree r and a accuracy requirement 0 < e < 1, we generate a distribution 
for a r.v. X' T approximating the true duration distribution X T for the task tree. We 
introduce the algorithm and prove that the algorithm indeed returns an e-approximation 
of the completion time of the plan. For a node v, let t v be the sub tree with v as root 
and let child v be the set of children of v. We use the notation |r| to denote the total 
number of nodes in r. 


Algorithm 2: Network(r, e) 
t Let v be the root of r // Hence, t v = r 

2 n v = \child v \ 

3 if v is a Primitive node then 

4 |_ return the distribution of v 

5 if v is a Sequence node then 

6 for c G child v do 

7 X c = Network (r c , 

8 return Sequence ({X c } cechUdv , ^-y) 

9 if v is a Parallel node then 
to for c G child v do 

11 L Xc = NetWOrk ^ min (T?ff ’ n„(|r„K + l) )) 

12 return Parallel i{X c } cechi id v ) 


Algorithm 2, that implements the operator Network, is a straightforward pos¬ 
torder traversal of the task tree. The only remaining issue is handling the error, in an 
amortized approach, as seen in the proof of the following theorem. 

Theorem 3. Given a task tree t, let X T be a r.v. representing the true distribution of 
the completion time for the network. Then Network(r, e) ss £ X T . 

Proof. By induction on |r|. Base: |t| = 1, the node must be primitive, and Network 
will just return the distribution unchanged which is obviously an e-approximation 
of itself. Suppose the claim is true for 1 < |r| < n. Let r be a task tree of 
size n and let v be the root of r. If v is a Sequence node, by the induction hy¬ 
pothesis that X c ?ej r ^ : | £/ /j Tii | X Tc , and by Theorem|T] the maximum accumulated er- 
ror is J2 c echild v l r d e /W + e /l T d = (n - 1)e/|t„| + e/|r„| = e for v, therefore, 
Sequence({.Y c } c6c /,iu„,e/fi) X T as required. If v is a Parallel node, by the in¬ 
duction hypothesis that X c « ec X Tc , where e c = min(^, n„( | r„|n„+i) ) so E c g c hiid v e c < 
YLcdchiid v < e - e/\r v \. Then, by Lemmad using K = \t v \ and n = n v , we get 
that Parallel({A' c } cec / li ; d „) w E X T as required. □ 

Theorem 4. Let N be the size of the task tree r, and M the size of the maximal support 
of each of the primitive tasks. IfO < e < jfTjftpp) an ‘d M < N/e, the Network ap- 
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proximation algorithm runs in time 0{{N 3 / e 2 )\og{N 3 / e 2 )), using 0(N 3 /e 2 ) mem¬ 
ory. 

Proof. The run-time and space bounds can be derived from the bounds on Sequence 
and on Parallel, as follows. In the Network algorithm, the trim accuracy param¬ 
eter is less than or equal to e/N. The support size (called m in Theorem|2]( of the vari¬ 
ables input to Sequence are 0(N 2 /e). Therefore, the complexity of the Sequence 
algorithm is 0((N 4 /e 2 ) log {N 3 /e 2 )) and the complexity of the Parallel operator 
is 0({N 3 /e) log(AT)). The time and space for sequence dominate, so the total time 
complexity is N times the complexity of Sequence and the space complexity is that 
of Sequence. □ 

If the constraining assumptions on M and e in Theorem[4]are lifted, the complexity 
is still polynomial: replace one instance of 1/e by max(m, 1/e), and the other by 
max( 1/e, N(N 2 + 1)) in the runtime complexity expression. 

6 Complexity results 

We show that the deadline problem is NP-hard, even for a task tree consisting only of 
primitive tasks and one sequence node, i.e. linear plans. 

Lemma 5. Let Y = { Y \,..., Y n } he a set of discrete real-valued r.v.s specified by 
probability mass functions with finite supports, T £ Z, and p £ [0,1]. Then, deciding 
whether Pr(fiT™=o Yi < T) > p is NP-Hard. 

Proof. By reduction from SubsetSum l [T2l problemnumberSP13]. Recall that SubsetSum 
is: given a set S = {si,..., s ra } of integers, and integer target value T, is thereasubset 
of S whose sum is exactly T1 Given an instance of SubsetSum , create the two-valued 
r.v.s Yi,... ,Y n with PrfYj = sf) = 1/2 and Pr(Yj = 0) = 1/2. By construction, 
there exists a subset of S summing to T if and only if Pr(fff™_ 0 Yi = T) > 0. 

Suppose that algorithm A(Y, T,p) can decide Pr(]G” =0 Yj < T) > p in polyno¬ 
mial time. Then, since the r.v.s Yj are two-valued uniform r.v.s, the only possible values 
of p are integer multiples of 1/2", and we can to compute p = -PrQ^”_ 0 Yj < T) using 
a binary search on p using n calls to A. To determine whether Pr( K ( = 7’) > 0, 
simply use this scheme twice, since Pr(Y^— 0 Yj = T) > 0 is true if and only if 
Yi<T)< Pr(Zt o X < T + !)■ □ 

Theorem 5. Finding the probability that a task tree satisfies a deadline T is NP-hard. 

Proof. Given a task tree consisting of leaf nodes, all being children of a single sequence 
node, its makespan is the sum of the completion times of the leaves. The theorem 
follows immediately from Lemma 0 □ 

Finally, we consider the linear utility function, i.e. the problem of computing an 
expected makespan of a task network. Note that although for linear plans the deadline 
problem is NP-hard, the expectation problem is trivial because the expectation of the 
sum of r.v.s X, is equal to the sum of the expectations of the X t s. For parallel nodes, 
it is easy to compute the CDF and therefore also easy to compute the expected value. 
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Despite that, for task networks consisting of both sequence nodes and parallel nodes, 
these methods cannot be effectively combined, and in fact, we have: 

Theorem 6. Computing the expected completion time of a task network is NP-hard. 

Proof. By reduction from subset sum. Construct r.v.s (“primitive tasks”) Y r as in the 
proof of Lemma[5] and denote by X the r.v. . \ Y,. Construct one parallel node with 
two children, one being the a sequence node having the completion time distribution 
defined by X, the other being a primitive task that has a completion time Tj with 
probability 1. (We will use more than one such case, which differ only in the value of 
Tj, hence the subscript j). Denote by Mj the r.v. that represents the completion time 
distribution of the parallel node, using this construction, with the respective Tj. Now 
consider computing the expectation of the Mj for the following cases: T\ = T + 1/2 
and X 2 = T + 1/4. Thus we have, for j £ {1,2}, by construction and the definition of 
expectation: 

E[Mi] = TjPr(X < Tf + ^ x Pr(X = x) 

x>T j 

= TjPr(X <T)+ J 2 x Pr ( x = x ) 

x>T +1 

where the second equality follows from the Y t all being integer-valued r.v.s (and there¬ 
fore X is also integer valued). Subtracting these expectations, we have E[Mi\ — 

E[Mf\ = \P{X < T). Therefore, using the computed expected values, we can 
compute P(X < T), and thus also P(X = T), in polynomial time. □ 

7 Empirical Evaluation 

We examine our approximation bounds in practice, and compare the results to exact 
computation of the CDF and to a simple stochastic sampling scheme. Three types of 
task trees are used in this evaluation: task trees used as execution plans for the (blinded 
for review) team entry in the DARPA robotics challenge (DRC simulation phase), lin¬ 
ear plans (seq), and plans for for the Logistics domain (from IPC2 http://ipc.icaps-conference.org/). 
The primitive task distributions were uniform distributions discretized to M values. 

In the Logistics domain, packages are to be transported by trucks or airplanes. 
Hierarchical plans were generated by the JSHOP2 planner lf2H for this domain and 
consisted of one parallel node (packages delivered in parallel), with children all being 
sequential plans. The duration distribution of all primitive tasks is uniform but the sup¬ 
port parameters were determined by the type of the task, in some tasks the distribution 
is fixed (such as for load and unload) and in others the distribution depends on the 
velocity of the vehicle and on the distance to be travelled. 

After running our approximation algorithm we also ran a variant that uses an in¬ 
verted version of the Trim operator, providing a lower bound of the CDF, as well as 
the upper bound generated by Algorithmic Running both variants allows us to bound 
the actual error, costing only a doubling of the run-time. Despite the fact that our error 
bound is theoretically tight, in practice and with actual distributions, according to Ta- 
bleQ] the resulting error in the algorithm was usually much better than the theoretical 
e bound. 
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Task Tree 

N 

M 

Approx, alg., e 

Sampling,# samples 

0.1 

0.01 

10 4 

10 b 

Drive(DRC) 

47 

2 

[-0.005,0.009] 

[-0.0004, 0.0004] 

0.0072 

0.0009 

47 

4 

[-0.01, 0.02] 

[-0.0009, 0.001] 

0.0075 

0.0011 

47 

10 

[-0.01, 0.03] 

[-0.001,0.003] 

0.0083 

0.0015 

Seq 

10 

4 

[-0.03, 0.04] 

[-0.003,0.004] 

0.008 

0.0016 

10 

10 

[-0.03, 0.06] 

[0.003, 0.007] 

0.0117 

0.001 

Logistics 

45 

4 

[-0.004,0.004] 

[-0.0004,0.0004] 

0.008 

0.0006 

45 

10 

[-0.005,0.006] 

[-0.0004,0.0006] 

0.013 

0.001 


Table 


: Estimation errors 


Task Tree 

N 

M 

Exact 

Approx., e 

Sampling, # samples 

0.1 

0.01 

10 4 

10 b 

Drive (DRC) 

47 

2 

1.49 

0.141 

1.14 

1.92 

190.4 

47 

4 

18.9 

0.34 

7.91 

2.1 

211.5 

47 

10 

> 2h 

1.036 

32.94 

2.81 

279.1 

Seq 

10 

4 

0.23 

0.003 

0.02 

0.545 

54.22 

10 

10 

10.22 

0.008 

0.073 

0.724 

72.4 

Logistics 

45 

4 

373.3 

0.2 

7 

2.5 

256 

45 

10 

> 4h 

2.19 

120 

3.12 

314 


Table 2: Runtime comparison (run times in seconds) 


We ran the exact algorithm, our approximation algorithm withe £ {0.1,0.01,0.001}, 
and a simple simulation with 10 3 to 10 7 samples, on networks from the DRC imple¬ 
mentation, sequence nodes with 10,20, and 50 children, and 20 Logistics domain plans, 
and several values of M. Results for a typical indicative subset (regretfully reduced due 
to space limits) are shown in tables [I] (error comparison) and [2] (runtime comparison). 
The exact algorithm times out in some cases. Both our approximation algorithm and 
the sampling algorithm handle all these cases, as our algorithm’s runtime is polynomial 
in N, M, and 1/e as is the sampling algorithm’s (time linear in number of samples). 

The advantage of the approximation algorithm is mainly in providing bounds with 
certainty as opposed to the bounds in-probability provided by sampling. Additionally, 
as predicted by theory, accuracy of the approximation algorithm improves linearly with 
1/e (and almost linear in runtime), whereas accuracy of sampling improves only as a 
square root of the number of samples. Thus, even in cases where sampling initially 
outperformed the approximation algorithm, increasing the required accuracy for both 
algorithms, eventually the approximation algorithm overtook the sampling algorithm. 


8 Discussion 

Numerous issues remain unresolved, briefly discussed below. Trivial improvements 
to the Trim operator are possible, such as the inverse version of the operator used 
to generate a lower bound for the empirical results. Other candidate improvements 
are not performing trimming (or even stopping a trimming operation) if the current 
support size is below 1/e, which may increase accuracy but also the runtime. Another 
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point is that in the combined algorithm, space and time complexity can be reduced by 
adding some Trim operations, especially after processing a parallel node, which is 
not done in our version. This may reduce accuracy, a trade-off yet to be examined. 
Another option is, when given a specific threshold, trying for higher accuracy in just 
the region of the threshold, but how to do that is non-trivial. For sampling schemes 
such methods are known, including adaptive sampling El ED, stratified sampling, and 
other schemes. It may be possible to apply such schemes to deterministic algorithms 
as well - an interesting issue for future work. 

Extension to continuous distributions: our algorithm can handle them by pre-running 
a version of the Trim operator on the primitive task distribution. Since one cannot it¬ 
erate over support values in a continuous distribution, start with the smallest support 
value (even if it is — oo), and find the value at which the CDF increases by e. This 
requires access to the inverse of the CDF, which is available, either exactly or approxi¬ 
mately, for many types of distributions. 

We showed that the expectation problem is also NP-hard. A natural question is on 
approximation algorithms for the expectation problem, but the answer here is not so 
obvious. Sampling algorithms may run into trouble if the target distribution contains 
major outliers, i.e. values very far from other values but with extremely low probabil¬ 
ity. Our approximation algorithm can also be used as-is to estimate the CDF and then 
to approximate the expectation, but we do not expect it to perform well because our 
current Trim operator only limits the amount of probability mass moved at each loca¬ 
tion to e, but does not limit the “distance” over which it is moved. The latter may be 
arbitrarily bad for estimating the expectation. Possibly adding simple binning schemes 
to the Trim operator in addition to limiting the moved probability mass to e may work, 
another issue for future research. 

Related work on computing makespan distributions includes lfl6l . which examines 
sum of Bernoulli distributed r.v.s. Other work examines both deterministic l20l and 
Monte-Carlo techniques @[T9j. Distribution of maximum of r.v.s was studied in (8], 
with a focus mostly on continuous distributions. 

Complexity of finding the probability that the makespan is under a given threshold 
in task networks was shown to be NP-hard in lfl4l . even when the completion time of 
each task has a Bernoulli distribution. Nevertheless, our results are orthogonal as the 
source of the complexity in 01 is in the graph structure, whereas in our setting the 
complexity is due to the size of the support. In fact for linear plans (an NP-hard case 
in our setting), the probability of meeting the deadline can be computed in low-order 
polynomial time for Bernoulli distributions, using straightforward dynamic program¬ 
ming. Makespan distributions in series parallel networks in the i.i.d. case was exam¬ 
ined in Ifl3l , without considering algorithmic issues. There is also a significant body of 
work on estimating the makespan of plans and schedules nsmiam, within a context 
of a planner or schedueler. The analysis in these papers is based on averaging or on 
limit theorems, and does not provide a guaranteed approximation scheme. 

Copmuting the distribution of the makespan in trees is a seemingly trivial prob¬ 
lem in probabilistic reasoning j23l . Given the task network, it is straightforward to 
represent the distribution using a Bayes network (BN) that has one node per task, and 
where the children of a node v in the task networks are represented by BN nodes that 
are parents of the BN node representing v. This results in a tree-shaped BN, where 
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it is well known that probabilistic reasoning can be done in time linear in the number 
of nodes, e.g. by belief propagation (message passing) gang. The difficulty is in 
the potentially exponential size of variable domains, which our algorithm, essentially a 
limited form of approximate belief propagation, avoids by trimming. 

Looking at makespan distribution computation as probabilistic reasoning leads to 
interesting issues for future research, such as how to handle task completion times that 
have dependencies, represented as a BN. Since reasoning in BNs is NP-hard even for 
binary-valued variables 00, this is unlikely in general. But for cases where the BN 
toplogy is tractable, such as for BNs with bounded treewidth El , or directed-path singly 
connected BNs l24l . a deterministic polynomial-time approximation scheme for the 
makespan distribution may be achievable. The research literature contains numerous 
randomized approximation schemes that handle depenencies l23l [25l . especially for 
the case with no evidence. In fact out original implementation of the sampling scheme 
in our DARPA robotics challenge entry handled dependent durations. It is unclear 
whether such sampling schemes can be adapted to handle dependencies and arbitrary 
evidence, such as: “the completion time of compound task X in the network is known 
to be exactly 1 hour from now”. Finally, one might consider additional commonly 
used utility functions, such as a “soft” deadline: the utility is a constant U before the 
deadline T, decreasing linearly to 0 until T + G for some “grace” duration G, and 0 
thereafter. 
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